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Abstract 

This paper is devoted to present and study a specific stochastic epidemic model ac- 
counting for the effect of contact-tracing on the spread of an infectious disease. Precisely, 
one considers here the situation in which individuals identified as infected by the pub- 
lic health detection system may contribute to detecting other infectious individuals by 
providing information related to persons with whom they have had possibly infectious 
contacts. The control strategy, that consists in examining each individual one has been 
able to identify on the basis of the information collected within a certain time period, is 
expected to reinforce efficiently the standard random-screening based detection and slack 
considerably the epidemic. In the novel modelling of the spread of a communicable in- 
fectious disease considered here, the population of interest evolves through demographic, 
infection and detection processes, in a way that its temporal evolution is described by 
a stochastic Markov process, of which the component accounting for the contact-tracing 
feature is assumed to be valued in a space of point measures. For adequate scalings of 
the demographic, infection and detection rates, it is shown to converge to the weak de- 
terministic solution of a PDE system, as a parameter n, interpreted as the population 
size roughly speaking, becomes large. From the perspective of the analysis of infectious 
disease data, this approximation result may serve as a key tool for exploring the asymp- 
totic properties of standard inference methods such as maximum likelihood estimation. 
We state preliminary statistical results in this context. Eventually, relation of the model 
to the available data of the HIV epidemic in Cuba, in which country a contact-tracing 
detection system has been set up since 1986, is investigated and numerical applications 
are carried out. 

Keywords mathematical epidemiology, stochastic SIR model, contact-tracing, measure- 
valued Markov process, HIV, large population approximation, central limit theorem, maximum 
likelihood estimation. 
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1 Introduction 

Since the seminal contribution of the mathematical modelling of epidemiological phenom- 
ena has received increasing attention in the applied mathematics literature. References devoted 
to epidemic modelling or statistical analysis of infectious disease data are much too numerous for 
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being listed in this paper (refer to [U [22] for recent accounts of stochastic epidemic modelhng, 
while deterministic models for the spread of infectious diseases are comprehensively presented 
and discussed in P,[3]). Here, an attempt is made to extend the 'general stochastic epidemic 
model', usually referred to as the standard SIR model, in order to take appropriate account of 
the effect of a contact-tracing control measure on the spread of the epidemic at the population 
level and to acquire a better understanding of the efficiency of this intervention strategy. 

In the area of public health practice, by contact-tracing one means the active detection 
mechanism that consists in asking individuals identified as infected to name persons with whom 
they have had possibly infectious contacts and then, on the basis of the information provided, 
in striving to find those persons in order to propose them a medical examination and a cure in 
the event of infection. Though expensive and controversial, 'contact-tracing' programs are now 
receiving much attention both in the scientific literature (see P ^ITTll^ or [2H] for instance) and 
in the public health community, within which they are generally considered as efficient guidance 
methods for bringing the spread of sexually transmissible diseases (STD's) under control. For 
instance, a contact-tracing detection system has been set up since 1986 for controlling the HIV 
epidemic in Cuba (refer to [TU] for an overview of the evolution of HIV/AIDS in Cuba), which 
shall serve as a running illustration for the concepts and methods studied in the present paper. 
From the perspective of public health guidance practice, mathematical modelling of epidemics 
in presence of a contact-tracing strategy reinforcing a screening-based detection system is a 
crucial stake, insofar as it may help evaluating the impact of this costly control measure. In 
this framework, epidemic models must naturally account for the fact that, once detected, an 
infected person keeps on playing a role in the evolution of the epidemic for a certain time by 
helping towards identification of infectious individuals. 

The primary goal of this paper is to generalize the standard SIR model by incorporating 
a structure by age in the subpopulation of detected individuals, age being here the time since 
which a person has been identified as infected. At any time, the 'R' class is described by a 
point measure, on which the contact-tracing detection rate is supposed to depend. In this man- 
ner, the way an 'R' individual contributes to contact-tracing detection may be made strongly 
dependent on the time since her/his detection through a given weight function ip, allowing 
for great fiexibility in the modelling. Assuming in particular a large population in which the 
infectious disease is spread, properties of the mathematical model are thoroughly investigated 
and preliminary statistical questions are tackled. Beyond stochastic modelling of the contact- 
tracing feature, the present work establishes large population limit results (law of large numbers 
and central limit theorem) for the measure- valued Markov process describing the epidemic (we 
follow in this respect the approach developed in [131 EH EU |35]), as well as in its application 
to statistical analysis of the epidemic. 

The paper is organized as follows. In Section [21 a Markov process with an age-structured 
component is introduced for modelling the temporal evolution of an epidemic in presence of 
contact-tracing. A short qualitative description is provided, aiming at giving an insight into 
how the dynamic is driven by a few key components. The process of interest is the solution of 
a stochastic differential equation (SDE) for which existence and uniqueness results are stated, 
together with a short probabilistic study. The main results of the paper are displayed in Section 
3. Considering a sequence of epidemic models with contact-tracing indexed by a parameter 
n G N* representing the population size roughly speaking, limit results are established when 
n ^ oo. Applications of the latter to the study of maximum likelihood estimators (based 
on complete data) in the context of statistical parametric estimation of the epidemic model 
with contact-tracing are then considered in Section 4. Eventually, in Section 5, these inference 
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techniques are applied for analyzing data related to the HIV epidemic in Cuba and drawing 
preliminary conclusions about the effectiveness of contact-tracing in this particular case: it can 
be seen that the chosen model especially reflects the growing efficiency of the contact-tracing 
detection method, the latter becoming almost as competitive as the random screening based 
method ten years after the beginning of the epidemic. Technical proofs are postponed to the 
Appendix. 

2 The stochastic SIR model with contact-tracing 

Epidemic problems really present a great challenge to probabilists and statisticians. Models 
for the spread of infections are based on hypotheses about such mechanisms as infection and 
detection. The huge diversity of possible hypotheses could give rise to an enormous variety of 
probabilistic models with their specific features. Although comprehensive mathematical mod- 
els should incorporate numerous features to account for real-life situations (such as population 
stratified according to socio-demographic characteristics, time-varying infectivity, effects of la- 
tent period, change in behavior, etc.), we shall deal with a stochastic epidemic model with a 
reasonably simple structure (a modified version of the standard 'Markovian SIR model with 
demography', actually), while covering some important aspects and keeping thus its perti- 
nence from the perspective of practical applications. Indeed, incorporating too many features 
would naturally make the model too difficult to study analytically. As previously mentioned, 
we are mainly concerned here with a probabilistic modelling of the spread of an infectious 
disease in presence of a contact-tracing control strategy in the long-range {i.e. when one can- 
not assume that the epidemic ceases before some demographic changes occur, leading up to 
take into consideration immigration/birth and emigration/death processes). A parcimonious 
Markovian structure for describing these features is stipulated, the main novelty arising from 
the measure-valued component incorporated into the model in order to account for the effects 
of contact-tracing. Beyond its simplicity, our modelling hopefully suffices to shed some light 
on the problem of investigating the efficacy of such a control measure. In the case of the HIV 
epidemic in Cuba for instance (see Section 5), the model obtained accounts for the fact that 
contact-tracing has become as efficient as random screening after 6 years. To our knowledge, 
earlier works have not allowed to construct a model reflecting this phenomenon (see jJjLj and 
the references therein). 

2.1 The population dynamics 

We start with a qualitative description of the population dynamics and a list of all possible 
events through which the population of interest may evolve (see Fig. 1). The population is 
structured into three classes corresponding to the different possible states with respect to the 
infectious disease. We adopt the standard SIR terminology for denoting the current status 
of an individual with the only differences that 'i?' stands here for the population of 'removed 
individuals willing to take part in the contact-tracing program' and that it is structured ac- 
cording to the age of detection, namely the time since a detected individual has been identified 
by the public health detection system as infected. Such a distinction allows for considering 
heterogeneity in the way each 'R' individual contributes to the contact-tracing control. Hence, 
at any time t >0 the class of removed individuals is described by Rt{da) in A1p(M+), the set 
of point measures on R+: for all < Oi < 02 < C)0, the quantity Rt{[ai,a2]) represents the 
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number of removed individuals who have been detected between times t — a2 and t — ai. Here 
and throughout, we use the notation {R,ip) = J ip{a)R{da), R being any positive measure on 
M+ and ip any i?-integrable function. In a more standard fashion, we shall denote by St and It 
the sizes of the classes of susceptible and infectious individuals. 



Ao 



s 


MS, I) 


I 






A3(/,(i?,^)) 



Figure 1: Flow-diagram of the SIR model with contact-tracing detection. 



Individuals immigrate one at a time according to a Poisson process of intensity Aq. Once in the 
population, an individual becomes 'susceptible' and may either leave the population without 
being contaminated (emigration or death) or independently be infected. Emigrations occur in 
the population at time t > with the hazard rate ixoSt and infections with the rate Ai(S't, It)- 
Once infected, an individual can be discovered by the detection system either by random 
screening ('spontaneous detection') or by contact-tracing, or else emigrates/dies. The hazard 
rates associated with these events are respectively A2/t, A3(/t, (i?^, ■?/')), where : M+ — >• M+ 
is a bounded and measurable weight function that determines the contribution of a removed 
individual to the contact-tracing control according to the time a she/he has been detected (see 
the examples discussed below) and fiilt. If detected, an individual takes part in the contact- 
tracing system by providing useful information related to her/his (possibly) infectious contacts. 
We do not consider the emigration/death of detected individuals since it is the availability of 
the information that they have given rather than their presence in the system that plays a role 
in the contact-tracing process. In order to avoid possible misunderstanding due to the notation, 
we underline that Ai(., .) and A3(., .) here denote jump rate functions related to the SIR process 
and not the individual rates (for instance, given the class sizes S and /, the infection rate of a 
given susceptible is Xi{S, I)/S). 

The events through which the sizes St, It and the point measure Rt evolve are numbered as 
follows: 



• Event E = 





recruitment of a susceptible. 


• Event E — 


1 


death/emigration of a susceptible. 


• Event E — 


2 


infection. 


• Event E = 


3 


'spontaneous' detection of an infective, 


• Event E — 


4 


detection of an infective by contact-tracing. 


• Event E — 


5 


death/emigration of an infective. 


Before providing 


a 


description of the population process introduced above via a system of SDEs, 



a few remarks and examples are in order. 
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Examples. (On modelling the contact- tracing feature) .4s previously explained, 
the removed individuals contribute to contact-tracing in function of the time since their detec- 
tions and through the weight function ip. 

1. In the case where the information provided by a detected person enables to examine individ- 
uals at a constant rate over a period of time of fixed length r > immediately after its detection 
(after this time period the information is considered as consumed), the weight function could be 
chosen as 

ij{a) = l{ae[o,r]}, for all a > 0. 

Here we have denoted by 1^: the indicator function of £. The second argument of the contact- 
tracing detection rate, {Rt,ip), is then the number of individuals detected between times t — r 
and t. 

2. The following choice: 

ijj^a) = e"'^-", for all a > 0, (1) 

with c > 0, is of particular interest when assuming that efficiency of the information provided 
by a detected individual (geometrically) decreases as the time a since its detection increases. 

3. To take into account the difficulties one may encounter at the early stages of the search for 
contacts, we can consider functions ip that are increasing from zero before decaying. From this 
viewpoint, a suitable beta or gamma density function would be possibly a reasonable choice of 
parametric weight function ip. 

Remark 1. (Explicit forms for jump rate functions) Until now, no explicit form for 
the infection rate and the detection by contact-tracing rate has been specified for generality's 
sake (it shall be nevertheless assumed that Ai and A3 both fulfill the collection of assumptions HI 
listed below). In practice typical choices for the infection rate function are XiSI, XiSI/{I -\- S) 
or Xil with Xi > (in order to lighten notation, abusively, we shall still denote by Ai and A3 
the parameters characterizing the parametric forms of the rate functions Ai(., .) and Xs{-, ., .) ). 
In the first example, a susceptible becomes infected with an individual rate Xil proportional 
to the number of infected individuals. This rate is generally referred to as the mass action 
principle based model. In contradistinction, the two last examples correspond to a situation 
where the rate at which a given infective makes infectious contact does not increase with the 
size S of the population of susceptibles. Equivalently, the individual rate at which a susceptible 
becomes infected, Xil / S or Ai//(/ + S), decreases when S increases. Such rates are usually 
termed frequency dependent. In a similar fashion, the rate of detection by contact-tracing may 
be chosen as X3l{R, ip) , X3l{R, ip) /{I + {R, ip)) or X3{R, ip) , with A3 > 0. 

Remark 2. (A more general framework) One may consider generalizations of the setup 
described above, stipulating for instance that the 'S' and 'I' classes are stratified according to 
socio- demographic features (or sexual behavior characteristics in the context of STD's) in order 
to account for heterogeneities caused by the social structure of the population, even if it entails 
introducing more duration variables in the model. One may also introduce the 'age of infection'. 
This would enable us to model directly time-varying infectivity, offering this way an alternative 
to so-called 'stage modelling' approaches (see ITSf for instance). The theoretical results of this 
paper may be extended in a straightforward manner to such more general frameworks. In order 
to lighten the notation and make proofs simpler, we restrict the study to the model described 
above. 

Assumptions HI: In the remainder of the paper, the rate functions Ai and A3 are assumed 
to belong to C^(M^), the set of real functions of class on M^. We denote by by dsXi, djXi, 
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diXs and 8^X3 their partial derivatives. We also suppose that all these functions are locally 
Lipschitz continuous and dominated by the mapping {x,x') G ^ xx': for k G {1,3}, we 
assume that VA^ > 0, 3Lk{N) > such that 

V(x, x'), {y, y') G [0, N]\ \K{x, x') - A,(i/, y')\ < Lk{N){\x -y\ + \x' -y'\), 

and that 3Afe > 0, V(x, x') G M^, \k{x,x') < X^xx' (and similarly for the partial derivatives). 
Finally, the weight function i}) is assumed measurable and bounded. Furthermore, for the 
central limit theorem, we will suppose it is of class . 

2.2 On describing the epidemic by stochastic differential equations 

Treading in the steps of [13] who fully developed a microscopic approach for ecological systems 
(see also [351 El] where age-structure is taken into account), we now describe the temporal 
evolution of the epidemic by a measure-valued SDK system driven by Poisson point measures. 

The process {{St, Rt{da))}t>o defined through the SDK system below takes its values in 
N X M X A^p(]R+) and may be seen as a generalization of the classical vector-valued Markov 
processes arising in stochastic SIR models. 

Definition 2.1. Consider a probability space (fi,JF, P), on which are defined: 

1. a random vector {Sq.Iq) with values in (N*)^ such that E[5'o + /o] < +00 (att = 0, we 
assume that no one has been detected yet), 

2. two independent Poisson point measures on M^, Q^{dv, du) and Q\dv, du), with intensity 
dv®du, the Lebesgue measure on M^, and independent from the initial conditions {So, Iq)- 

Define {{St, It, Rt{da))}t>o as the Markov process solution of the following system of SDEs: 

St = So + (lo<«<Ao - lAo<«<Ao+/io5„-+Ai(S„_,/„_)) Q^{dv, du) 

^ It = I0 + L=oIZo'^>^o<u<\o+Xi{h-,s^-)Q^{dv,du) 

- Iv=0 /m=0 '^0<u<i^,^+X2)h^+X■i{h-,{R^-,^(>))Q^{dv, du) 

{Rt, f) = X=o IZo /(0)lo<n<A2/,_+A3{/„-,(fi.-,^»<5^(^^) du) + JZo daf {a)Rv{da)dv , 

(2) 

for all f G C^(]R+) the set of real bounded functions of class with bounded derivatives. We 
have denoted by daf the gradient of f and by g{t—) the left limit m t G M of any cddlag function 
g:R^R. 

Under HI and (i) of Definition 12. 1^ it may be seen that there exists a unique strong (non 
explosive) solution to SDE ([2]). The proof is a slight modification of the proofs of Section 2.2 
in [3l|. 

From a practical perspective, we also emphasize that this approach paves the way for sim- 
ulating trajectories of the epidemic process (see Fig. [2]). An attractive advantage of stochastic 
models in mathematical epidemiology indeed lies in their ability to reproduce certain variability 
features of the observed data. As an illustration, a simulated trajectory of {{It, {Rt, l))}t>o 
generated from Eq. ([2]) with jump rates XiSI and X3l{R,ip) is displayed in Fig. O For 
comparison purpose, the observed cumulative number of detected HIV+ individuals in Cuba 
(1986-2006) has been juxtaposed (see Section [5] for further details). It can be seen that over 
the first 20 years of the epidemic, the simulated and observed curves for the cumulated number 
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Figure 2: Simulations for the Cuban epidemic: simulated evolution of the size of the I class 
(blue) and of the cumulated size of the R class (green). The bold red line stands for the observed 
cumulated size of the R class, computed from data related to the Cuban HIV epidemic over the 
period 1986-2006. We have chosen ip = l[o,4]- In order to mimic the change in trend that can be 
observed between 1995 and 2000, two periods have been separately considered. During the first 
period (i.e. the first fifteen years) the parameters of the simulation have been picked as follows: 
So = 510^ Jo = 230, Ao = 10~^ fio = 10"^ /ii = 6.6 10"^ Ai = 1.14 10"^ A2 = 3.7510-^ 
and A3 = 6.5510"^. In the second period, we have used: Ai = 1.1610^'^, A2 = 4.4510^"*^ and 
A3 = 2.50 10-^ 




of detections are fairly closed to each other, although, between years 7 and 15, the observed 
curve is slightly below the simulated one (it should be noticed that, during these years, less 
funds were available for the management of the contact-tracing detection system because of the 
economic crisis that followed the collapse of the Soviet Union). 

2.3 Limiting behavior in long time asymptotics 

We now state a limit result for the epidemic process introduced above, as time goes to infinity. 
Refer to AI in the Appendix for a proof based on coupling analysis. 

Proposition 2.1. Assume that f{a) as a —> 00. Considering the Markov process 
{{St, It, Rt{da))}t>o introduced in Definition \2.1\ we have, whatever the initial conditions {Sq, Iq) G 
(M*)^, that {St, It, {Rt{da), /)) {Soo, 0, 0) in distribution ast ^ 00, denoting by Soo a Poisson 
random variable of parameter Ao//io- 

The law of Soo is the stationary distribution of the N-valued immigration and death process 
which jumps from k to k + 1 with rate Aq and from k to k — 1 with rate fiok, and is obtained 
in Appendix AI. This ergodicity result shows that the time of extinction of the epidemic is 
almost surely finite, though it may be very long in practice. It is worth mentioning that in the 
situation of long-lasting epidemics, as in the HIV case, the long term behavior of the epidemic 
conditioned upon its non extinction may be refined by studying quasi- stationary measures (see 
[38] for instance). We leave this question (far from trivial when the state space is not finite) 
for further research. 
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3 Large population limits 



The overall purpose of this section is to provide a thorough analysis of the measure- valued SIR 
process introduced above from the 'large population approximation' perspective (one may refer 
to [12] for an account of approximation theorems for Markov processes and to Chapt. 5 in [4J 
for applications of these concepts in approximating vector- valued SIR models), based on the 
recent techniques developed in |13J and [3ll [35] for ecological systems. 

3.1 Renormalization 

We consider a sequence, {{{Sl^\ R["'\da))}t>o, n G N*), of SIR processes with contact- 
tracing. For n > 1, {{S[^\ R^^\da))}t>o corresponds to the stochastic process described 
in Definition 12. 11 starting from {Sl^\ Jq"'*) of size proportional to n and with the following rate 
modifications: the immigration rate is nAo, the infection jump rate function is nXi{S^'^yn, I^^yn), 
while the contact-tracing jump rate function is nX^lI^^^ /n, {R^^\ ip) /n). We denote by {s^^\ if'\r[ 
{S[^'' /n,l[^^ /n,R^^\da)/n) the renormalized process obtained by re- weighting all individuals 
of the population by 1/n. We assume furthermore that (sq^^Zq"'') converges in probability to 
a deterministic couple (so,io) £ as n ^ oo. The moment condition below shall also be 
required in the sequel. Let p > 2. 

Moment assumption Mpi sup^gp^* E[(so"^)p + (4"^)^] < +oo. 

This moment assumption combined with Assumptions HI implies that the moments of order p 
propagate on compact time intervals [0, T] with T > (see [131 El] Section 3.1.2). Before writing 
down the martingale problem associated with {{s["'\ rl"'\da))}t>o for a given n G N*, let us 
give an insight into the way the renormalizations above may be interpreted in some important 
examples: 

Remark 3. (On the meaning of renormalization in basic examples) In the case of 
homogeneous rate functions, the eventual impact of the renormalization on the jump rates may 
be described as follows. 

• With Aq"^ = nAo, the immigration/birth rate is assumed proportional to the initial popu- 
lation size, 

• // the form chosen for Xi{S, I) is either AiJ or XiSI/ {I + S), the renormalized infection 
rate function, Ai/*^"^ or XiS^"'^ 1^^^ {I^"'^ + S^^^) , is not affected by the scaling, while if one 
takes Xi{S,I) = XiSI, the renormalized rate function XiS^'^'^ I^^^ /n decreases proportion- 
ately to 1/n. This reflects the fact that for large scalings (corresponding to large "typical" 
population sizes ) the risk of being contaminated by a given infectious individual is smaller 
that for small scalings. 

• The same remark holds for the contact-tracing rate function X^ll, (i?, ■?/')). 

The next proposition gives a semi-martingale decomposition for {{s^/^\i[^\rl"\da))}t>o, 
which shall play a crucial role in our analysis. 

Proposition 3.1. Let n eW, t>0 and f : (a, n) t-^ fu{a) a function in Cl(Rl.). Under HI 
and the moment condition Mp with p > 2, 
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V / V (4"\/.)-/:_o{(ri"\5./. + 9.A)+A(0)(A2^i") + A3(zl"\(ri^^ / 

(3) 

is a cddldg L"^ -martingale, with predictable quadratic variation given by: 



' (M^'W), = 1 Ao + {^,st^ + Ai(.l"\ zl"))} du 

(M^'W, M^'W), = -i Ai(si"\ du, {M^^^^\ M'-'(")(/))i = 
. (M^.W,M^'W(/)), = -i /to/.(0){A2^i"^ + Ut\ {T^u\mdu. 

This result follows from the representation given in Definition I2.H in which the Poisson 
measures and are introduced. It may be established by following line by line the proof 
of Theorem 5.2 of |T3j and Theorem 3.1.8 of [31], technical details are thus omitted. 

3.2 Main results for the large population limit 
3.2.1 Law of large numbers 

Before stating our first limit result for the sequence of renormalized SIR processes introduced 
above, we make clear the topology we consider. We denote by ^^^7(1^+) the space of finite 
measures on M+, endowed with the metrizable weak convergence topology (see [30]). For all n G 
N*, the sample paths {(4"\ 4"^)}i>o belong to the Skorohod space D(M+,M^ x Mf{^+)) 
equipped with the metrizable Ji topology (see § 2.1 in [20] for further details). 

Heuristically, since the quadratic variation of the martingale process displayed above is of 
order 1/n, one obtains a deterministic limit by letting n tend to infinity. As a matter of fact, 
consider the system of deterministic evolution equations, obtained by equating to zero the 
martingale process in Proposition 13.11 

St = So + /^*=o (-^0 - ^^oSu - Ai(s„, iu)) du 

it = io + /„=o i^^i^u, Q - (/ii + >^2)iu - h{iu, {ru, 4'))) du 

{n, ft) = /„*=o {IZo i^ufia, u) + dafia, u)) r„(da) + /(O, u) {X2iu + hiiu, {ru, i^)))} du 

(4) 

for all / e Cj(M^). The result below states that there exists a unique (smooth) solution to this 
deterministic system, to which the sequence 

{(s("),i("\r^"Hc^a))} 

„>i converges in probability. 

This may be viewed as an extension to our measure- valued setup of the Law of Large Numbers 
stated in Theorem 5.2 of |4j for vector-valued SIR processes in large population asymptotics 
(see the references therein). A sketch of proof stands in Appendix A2. 

Theorem 3.1. (Law of Large Numbers) Under HI and the moment condition Mp withp > 
2, as the size parameter n tends to infinity, the sequence of processes {{s^"'\i^"'\r^"'\da))}nm* 
converges in probability in D(R_|_,]R^ x 7Wi?(M_|_)) to the unique solution {{st, it, rt{da))}t>o of 
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(i) for all t > 0, the measure rt{da) is absolutely continuous with respect to the Lebesgue 
measure. Denoting by pt{a) its density, the map (a,t) ^ Pt{,o) is differentiable on the set 
{a < t} containing its support, 

(a) the map t ^ {st,it) is of class . 

By virtue of the regularity properties mentioned above, (st, it, Pt)t>o also solves the following 
PDE system with initial conditions {so,io,0), denoting the constant function equal to zero: 



dst 
dt 
dit 
dt 

Pt(0) 



Ao - /io^t - Ai(st,2t) 

Ai(st, it) - (/ii + \2)it - A3 (it, /r^ 'ip{a)pt{a)da 
-dapt{a) 

^2it + A3 [it, L ip{a)pt{a)da 



(5) 



This PDE system may be seen as a generalization of deterministic epidemic models intro- 
duced in [11] (see also the references therein), taking into account the effects of the contact- 
tracing strategy and defined through a classical differential system. Besides, we point out 
that the increase of the time since detection ('detection aging') is translated into a transport 
equation (the third equation in ([S])), with a boundary condition for a = (fourth equation in 
(E])). This is a well-known fact in age-structured population models (see [23] for instance). It 
is easy to prove that the solution is of the form pt{a) = pt-a{0). One then recovers a delay- 
differential equation system (DDE), similar as those recently considered in epidemic modelling 
(see O [HI IMl EH I22] and the references therein for instance). In this respect, it should be 
noticed that the DDE system may be classically simplified when the weight function is 
exponential as in Example 2. In this case, it can be replaced by a system of PDEs with finite 
dimensional variables only. 



3.2.2 Central limit theorem 



In order to refine the limit result stated in Theorem 13.11 we establish a central limit theorem 
(CLT), describing how the renormalized epidemic process {s^'^\ i^"\ r^"\da)) fluctuates around 
the solution of 1^. This is an adaptation of the results obtained in Chapter 4 of [31] for 
age-structured birth and death processes. Let T > and consider the sequence of fluctuation 
processes: 



(n) 



( 



in) 
St 
.(n) 



St 



\ 



\y rf'\da) — rt{da) J 
, where A^5'(]R) denotes the space of signed 



(6) 



t e [0, T], n e N*, with values in M x M x Ms 
measures on M equipped with its Borel a-field. 



Functional preliminaries. Since A^5(M) embedded with the weak convergence topology is 
not metrizable, we will in fact consider the sequence {ri^'^'^\da))n&i* as a sequence of processes 
with values in a well-chosen distribution space. In order to prove its tightness, we link this 
distribution space to certain Hilbert spaces. We are inspired by the works of Metivier [26] . 
Meleard [21], and consider the following spaces: 
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Definition 3.1. For g N, 7 e M+, C^'^ is the space of functions f of class such that 
Vfc < f3, \f^^\a)\/{l + |a|^) vanishes as \a\ — * +00, equipped with the norm: 

k<l3 ^^'^ ' ' 

The spaces are Banach spaces and we denote by C their dual spaces. 

Wq'"' is the closure of the space C^(M) of infinitely differentiable functions f with compact 
support in M for the norm \\.\\u/ii.i defined by: 



2 



^ 1 + aPT ^ ' 



k<f3 

The spaces Wq'"' are Hilbert spaces and we denote by W^^''^ their dual spaces. 

In the following, we will be interested in the following spaces (see [H |21] for the continuous 
injections). 

and ^ W^o"^-' ^ ^-^'^ W^o"'"' ^^.5. W^o"''' ^ C""''"- (9) 

The fluctuation processes are now viewed as taking their values in the dual space C"^'^ for 
technical reasons. The space C~^'^ is continuously included in VFq"^'^ which in turn is included 
in VFq~^'^ t)y a Hilbert-Schmidt type embedding. It is in this space that the convergence in 
distribution stated in the next theorem is proved (see Appendix A3 for a detailed sketch of the 
proof). 

Theorem 3.2. (Central Limit Theorem) Suppose that HI and the moment assumption 

Mp withp > 2 are fulfilled and that sup ^|?7q''"^P + |?7o^"^pj < +00. Then, ({rit"'^}teio,T])n>i, 

as a sequence of random variables with values in D([0,T],M x M x Wq^''^), converges in law to 
the unique solution of the following equation: Wt G [0,T], 

'nt = Vo + Wt+ -^{{su^iu^ru), r]u)du, (10) 

Ju=0 

where ^'((s^, r„), 77^) 

ds>^iisu,iu)vt + [di\i{su,iu) + yUi + A2 + di\s{iu, {ru,^))Wu + dnX^ilu, {ru,^)){r]l,^) 
6o[\2 + diX^iiu, (r„, V^))]r/; + dodnX^iiiu, (r„, V^))(r/;, ^) + J*r/;, 

withMt G [0,T], V/ G Wq'\'-^ C2'2), {J;t]IJ) = J^^daf{aXt{da), and where W = {W',W\W' 
is a continuous, centered, square-integrable Gaussian process o/C([0,T],R^ x Wq"^'^). For every 
t G [0, T] and all f G Wq^, the quadratic variation of {W^, W^, {W", f)t)t^[Q'j'] ^-^ given by: 

{W'')t = fQ{\o + lJ'OSu + Xi{su,iu))du, 

{W')t = /o {\i{su,iu) + (/ii + >^2)iu + h{iu, {ru,^)))du 

{W'U))t = jlfm{X2^u + H^uAru.mdu . (11) 

(ly^ w')t = - /o Ai(s„, i^)du, /)), = 

{W\ f)), = - jl f^{0) {X2tu + A3(^„, (r„, ^))) du 
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4 Statistical inference by maximum likelihood estima- 
tion 



We now turn to the problem of estimating the jump rates governing the dynamics of the epi- 
demic, in a parametric setting. Although, generally, not all events related to the epidemic 
are observable in practice, in this premier work we deal with the ideal case where one dispose 
of complete epidemic data by means of maximum likelihood estimation (MLE). Indeed, MLE 
methods for complete data are of interest from a practical viewpoint, insofar as in certain sit- 
uations they may be readily used after implementing adequate augmentation data procedures. 
They constitute, besides, the maximization step of (Monte Carlo-) EM procedures, which are 
extensively used for analyzing infectious disease data (see |5] for instance). However, we stress 
that the question of validly implementing EM-procedures in a continuous-time process setup 
is not without pitfalls (see [32l E] and refer to [IT] for an account of sequential variants of 
the MCEM algorithm, tailored for such a framework). Developing inference methods for the 
present model based on incomplete data (on the incidence process solely, for instance) shall be 
the scope of further research. 

We start with some definitions. Let us fix the renormalization parameter n. We associate 
to the process {s["\il^\rl-^'')t>o the sequence {i?^"'', T^"''}fcgN* where {Ti"^}fcGN* is the sequence 
of successive jump times of the process, and where E^'' G £^ = {0, . . . , 5} is the type of event 
occurring at time T^"'', k > 1 (see Section [2.11) . By convention, the time origin is Tq"^ = 0. 
For notational simplicity only, the rates /io, /^i, Aq, Ai are supposed to be known and we focus 
on the estimation of the detection rates A2 and A3 (extensions to a more general statistical 
framework are straightforward, in particular when estimating the infection rate is the matter). 
We suppose that the latter are entirely determined by a parameter 6, taking values in a set 
e C M^, > 1: A2 = X2{0) and A3(.,.) = A3(.,.,^). We set {Pejeee the resulting family of 
probability measures on the underlying space {Q,A). We denote by P the probabihty measure 
on (fi. A) corresponding to the case when the (i?^"'')'s are i.i.d. and uniformly distributed on 8, 
independent from the durations AT^^"^ = T^""^ — T^^"\, G N, supposed i.i.d. and exponentially 
distributed with mean n/6. 

4.1 The likelihood function 

Let T > and n e W. We denote by k!^^ = ^ 

fc>i -'-{r("'<r} total number of events 
occurring before time T and write the likelihood of {(-E'^"\ T^"'')}^^^^^(n). The complete history 

of the epidemic until time T is described by the a-field J-'^"^ = cr{sl"\ i^\ ri"\ u < t}. With the 
notation above, the statistical model {^l, A, {Pejeee) is dominated along the filtration {^t'^^)t>o 
and P is a dominating probability measure. In particular, for all ^ G 0, we have on JF^"^ that 
Pg = C^^\e) . P with the likelihood: 

cPie) =exp (nT - f (nAo + /ion^"^ + nAi(4"\4")) + {inn + A2(^)n)ii") + nA3(4"\ {r^u\i^) ,G))du 

\ Ju=0 

X nL,(ii;fc,(.g\ig\rg)(da))), (12) 

k=l 
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where: 

l{i5=0} 



Le{E, {s,i,r{da))) = Ao^"^°> (/ios)'<--^> Xi{s,iY^^--^ {X^iOW^^--^^ (r, 7/;), 

If ^* G denotes the 'true value' of the parameter, by taking the logarithm, keeping the terms 
depending on 9 solely and using the representation of Definition 12.11 one is lead to maximize 
the log-likelihood: 



in). 



t=0 Ju=0 



+ l0g(A3(C , (^t^^)'^)l{A,(e-)ne)<«<A.(e-)ne'+nA3(4:!\(r(:!),^>,e-)}J Q\dt,du) 

-n r {Xm't^ + \,{t\{rt\i^),e)}dt. (13) 



4.2 MLE consistency 

Consider the ML estimator for T > and n G N*: 



^„ = argmax4"^(^). (14) 
The following assumptions shall be required: 

Identifiability assumption H2: The map 6 ^ (^2(^)5 -,0)) is injective. 

Regularity assumption Rl: For all (x, y) G M.'^, the maps 9 G Q ^ \2{0) and 6* G 9 i— > 
\3{x,y,9), are equicontinuous. 

As shown by the result below, under the basic identifiability and regularity conditions 
stipulated above, ML estimators are consistent. 

Theorem 4.1. (Consistency of ML Estimators) Set $(x) = log(a;) + 1/a; - 1. Under 
Assumptions HI, Mp with p > 2, H2 and Rl, for allT > and any {6*, 9) G O^, as n cxo, 
we have the following convergence in ¥g* -probability, 

K^{9,9*) = -{1^(9*) - l^^\9)} ^ K{9,9*), (15) 
n 

where: K{9,9*) = C H9*yM^^)dt + C X^l, (r*, ^$( ^5: tl' 



*=o ^ 'HO)' Jt=o ^ U3(^^(rr,0),^) 

denoting by [s* , i* , r* (da)) the solution of the PDF system 1^ with rate functions associated 
with 9*. 

Under the further assumption that the parameter space B is compact, the ML estimator [T^ is 
consistent: 

lim 9n = 9*, in Fg* — probability. (16) 

71— >00 

This result mainly relies on the Law of Large Numbers stated in Theorem 13.11 (see A4 in 
the Appendix for technical details). 
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4.3 MLE asymptotic normality 

In order to refine our study of tfie asymptotic beliavior of tlie ML estimator, we suppose that 
the stronger regularity assumption below is satisfied. 

Regularity condition R2: For all {x,y) G M*^, the maps 6 e & ^ ^2(0) and 6 e & f—>- 
\3{x,y,9) are twice continuously differentiable. 

Let Tieg denote the hessian matrix of any twice differentiable function G O 1— g{d)- 
Observe that the Fisher information matrix is given by: 

Ie = - r ineHOK - 1 ) - V,A2(^^) ^9X2(0) 



u=0 



X3(^:,(r:,^),r) 

X3{^*uAK,^),0) 

- V,A3«, «, ^P),e) V.A3(z:, «, ^P), ^) ^3{|j^r^|;^)2 } du. (17) 

The next limit result then follows from Theorem 13.21 (see Appendix A5). 

Theorem 4.2. (Asymptotic Normality of ML Estimators) Suppose that the assump- 
tions of Theorem \4.1\ are fulfilled with the additional condition R2. Then we have the following 
convergence in distribution under Fg* : 

as n 00, is 

where Iq* is given by [T^ . 

Moreover, ifXg* is invertible, then the ML estimator [14\ ) is asymptotically normal: under 
Fg*, we have the convergence in distribution 

^{e^ - 9*) =^ ^^{0,Ig^), asn^oo. (19) 
4.4 MLE on first simple examples 

Let us consider the important situation where X^li, {r,ip)) has a multiplicative form. We will 
consider the three following models: Vz G M+, Vr G A^i7(M+), 

Model (A): A3(2,(r,V^)) = X,{r,,p), (20) 

Model (B): A3(^,(r,V^)) = A3-4^, (21) 

Model (C): A3(z,(r,V')) = X^{r,,p)i. (22) 

Here 6 = (A2,A3) G 6 C M*^, and the true parameter is denoted 6* = (A2,A3). In this case, 
explicit maximum likelihood estimators can be obtained for A2 and A3. 

We differentiate the log-likelihood with respect to A2 and A3 and obtain score processes that 
we equate to zero. For Model (A) (see Eq. (|20i) ). we have for n G N* and t G [0,T], 



dl 



(n) 



dX2 



(A2, A3) = fZo ilo<«<Ai™(l'^'(^^' - ^ iLo ''^^dv 



t (A2, A3) = /to 1=0 ilA*ni(")<n<A*ni(")+nA*(.("),^>^'(^^' " ^ /to (^^"^ ^) 
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and one gets: 



Ji,=o^^ '^'^ Jo ^Wjdv 

The Fisher information matrix may be easily exphcited in this case: 



^\ with al = ^j\rl,m- 
al J A3 Jo 



Now in the case where the contact-tracing detection rate is of the form fl2ip or fl22l) . the 
MLE estimate and its asymptotic variance remain both unchanged, while we get for Model 
(B): 



lv=0 lu=0 ^A^n4"J<n<A^n4'l)+nA*4"J(r('L\V'>/(4-+(^i- .'^>)'^^*'^^' 



2 
cr. 



1 r 



and for Model (C): 

3 ~ rT-H/ (n) ,\, > CTs - / ^tVt^Vjat. 

n Jq tv [Tv ,ip)av ^3 ^0 



5 Application to HIV data related to the Cuban epi- 
demic (1986-96) 

This section is devoted to briefly present and discuss preliminary numerical results derived from 
the application of the statistical modelling previously described to real data. These data are 
related to the HIV epidemic in Cuba over the period 1986-1996. Our aim is to illustrate the 
practical interest of the theoretical notions considered in this paper. Owing to space limita- 
tions, the statistical issues of validating the parametric model stipulated below for such data 
shall be thoroughly investigated in a forthcoming paper, entirely dedicated to model checking, 
estimation and testing. 

The Cuban HIV epidemic. In prospect of management and analysis of the epidemic, infor- 
mation related to the spread of HIV/AIDS in Cuba has started to be collected and gathered 
(in a now massive and well-documented data repository) since 1986, after the first HIV cases 
were detected (see the site of the World Health Organization [IQ] and refer to [lO] for a detailed 
description of the HIV/AIDS epidemic in Cuba). Each time a person is detected as seropositive 
(HIV+), the following information is reported: date and way of detection, age, gender, area of 
residence, gender of sexual partners in the last two years. Furthermore, from the beginning of 
1986, all detected persons are invited to give names and contact details of their recent sexual 
partners. Theses partners are then traced and a recommendation for HIV testing is made over 
a period of one year after the last sexual contact with the HIV-infected person (see [E] for 
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further details on the Cuban management system of the AIDS epidemic). Based on the infor- 
mation provided, through the interview following HIV detection in the one hand and through 
the contact-tracing system in the other hand, a plausible date of infection for each individual 
is reported in the database by the Health authorities (here, as a first go, we shall use these 
approximate dates as if they were all exact). After being detected, the person receives, either 
in the sanatoria or in the ambulatory system, regular counselling on living with HIV in order 
to prevent the risk of transmitting the retrovirus (defending in this respect the assumption 
that, once detected, a HIV carrier does not belong to the I population any more). Besides, it 
is essential to notice that one may assert that HIV spreads in Cuba by means of sexual trans- 
mission quasi-solely, due to certain distinctive sociological features (see pS]): indeed, since the 
epidemic began, injection drug use and blood transfusion accounted for a negligible number of 
infections, corresponding to very isolated cases. 

ML estimation. A first attempt is now made to fit the three simple parametric SIR models 
with contact-tracing described in Section U3] with the weight function il){a) = exp(— ca) (the 
age a being given in days and the constant c being a parameter that we will choose (see ([I]))). 
From the available data, the trajectory {{s["'\il"'\rl-"\da))}t(z[Q^T] may be reconstructed over 
the 2400 first days after the first detection (more than 6.5 years). Using the results of Section 
4.4, we obtain the numerical results of Table [H We have chosen c = 10~^, c = 10~^ and 
c = 3 10~^, which correspond to various lifelengths of the information given by a detected 
patient to trace her/his infectious non detected partners. 



model 


parameter 


estimated value 


asymptotic std 


log-likelihood 




A2 


9.5710"^ 


4.3410-5 








c= 10" 


2 




(A) 


A3 


3.9010"=^ 


2.3010-^ 


-2115 


(B) 


A3 


4.5010"^ 


2.6710-^ 


-2119 


(C) 


A3 


1.8510"^ 


1.0910-6 


-2117 






c= 10- 


3 




(A) 


A3 


6.5610"^ 


3.8710-5 


-2138 


(B) 


A3 


1.3010-=^ 


7.7210-5 


-2143 


(C) 


A3 


3.11 10'^ 


1.8310-^ 


-2140 






c = 310 


-4 




(A) 


A3 


4.3710"^ 


2.5810-5 


-2144 


(B) 


A3 


1.1010"^ 


6.5410-5 


-2146 


(C) 


A3 


2.0710-6 


1.2210-^ 


-2147 



Table 1: Estimated parameters and asymptotic standard deviations. 

Figure [3] shows the instantaneous detection rates t ^ Aj"''*^"'' and t X^^\i^^^\ (r|"\ tp)) for 
the three models (A), (B) and (C). 

Recall that A2 and its asymptotic variance remain unchanged, whatever the model consid- 
ered (A), (B) or (C). For the instantaneous rate of contact-tracing detection, we notice that 
the curves obtained by the models (A), (B) and (C) are very similar. This suggests that the 
model might be relatively robust to the choice of (provided it has the exponential parametric 
form stipulated here). The curves for t ^ A^"'^^(rJ"\v^) and t ^ \p''\r^^\i,) are very 
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(b) 




(in thick blue line) and 



Figure 3: Evolution of the instantaneous detection rates t 

1 1— > A3"^(4"'', {'rt^\'ip)) (in green (resp. red and black) line for Model (A) (resp. (B) and (C))) 
(a) : c = 10-2, (b) : c = 10'^, (c) : c = 3 10"^ 



close. This is due to the fact that on the considered period, the number of infectious individuals 
remains stable. Compared with Models (A) and (C), the estimated instantaneous detection 
rate in Model (B) is more important in the beginning and less important in the end. This is 
due to the nonlinearity introduced by the denominator in I^T^ and to the fact that the number 
of detected individuals increases with time. The information given by a detected individual 
to trace new HIV+ ones has a smaller effect when the number of individuals is already high 
(certain "networks" have already been discovered then). We also underline that the larger c is 
and the closer the curves for the contact-tracing detection are. 




990 1991 1992 




Time 



Time 



990 1991 1992 1993 



Figure 4: (a): 3-months Moving Average for the number of total detections (thin black line) and 
for the number of detection by contact-tracing (thick blue line), (b): Percentage of detections 
by contact-tracing obtained with the moving averages computed over 1 month (dotted magenta), 
2 months (dash-dot blue), 3 months (dashed red) and 4 months (solid black). 



On Fig. [3], it can be seen that the rate of detection by contact-tracing increases with time in 
each of the three models fl20l) - fl22l) . It seems that, in all cases, the weight of the contact-tracing 
detection increases to the point of almost counterbalancing the one of the alternative way of 
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detection. This somehow corroborates the phenomenon underhned by Fig. HI in which the 
proportion of individuals detected by contact-tracing among all detected individuals computed 
using a moving-window of fixed length is plotted. The graphs displayed in Fig. Hlare model-free 
and show that the ratio of detections by contact-tracing among all detections stabilizes around 
1/2, after having fluctuated during the 6 first years. These years correspond to the "burn-in 
period" needed for the contact-tracing detection system to start being really efficient. 

Of course, a very large number of practical questions related to model fitting and interpreta- 
tion/projection merit further investigation naturally arise, after this premier work. The impact 
of the choice of the weight function ip on the obtained results should be carefully investigated 
for instance. As mentioned above, this shall be the subject of further research, much more 
oriented towards numerical applications and the practical use of the model. 
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Appendix - Technical proofs 



Al Proof of Proposition 12.1 



Observe first that, when the Markov epidemic process [S, I, R{da)) is restricted to the absorbing 
set N X {0} X A^p(M+), the process {St)t>o evolves as an immigration and death process with 
immigration rate Aq and death rate fio^t- This process is classically geometrically positive re- 
current. Hence, it only remains to show that the set N x {0} x A1p(]R_|_) is reached in finite time 
with probability one, no matter what the initial state. This may be straightforwardly estab- 
lished by coupling analysis: it actually suffices to consider a SIR process (S", /', R!{da)) with the 
same dynamics as {S, I , R{da)) except that, when 1 = one suppose that new infectives may 
be recruited from the outside at a strictly positive rate. Then, one may easily see that (S", /') 
is an irreducible (non explosive) Markov process with state space N^. Besides, denoting by F 
its transition rate matrix, it is straightforward to check that the following Foster-Lyapounov's 
drift criterion is satisfied, with test function /(m, I) = m + I: 

J2 T{{m,l),{m',l')){f{m',l')-f{m,l)) < - - {fi^ + X2)l < -cf {m, I) + d, 

(m',«')eN2 

with c = min(/io, fii + A2) and d = Xq. By virtue of Theorem 7.1 in [27J, (S", /') is geometrically 
recurrent. Thus, when starting from (S'o,-Ro); and (5",/) reach N x {0} in a finite time 

T(^So,Rti) with finite exponential moment. 

Computation of 6*00 's distribution. Let us show that 6*00 is a Poisson random variable of 
parameter Ao//io- Set pk ■= P {Soo = k) for all A; G N. We have 



AoPo + /^oPi = 0, and >l, X^pk-i - >^QPk + ^^Q{k + l)pk+i - l^okpk = 0. (23) 
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We obtain from the first equation that pi = XoPo/fJ^o- Assume that we have proved for A; G N 
that: 

Let us prove that ( l24l) holds for k + 1. From ( 123|) we have: 



(25) 

Since the sequence {pk} defines a probabihty measure, we have ^fc^Pfc = 1- This entails that 
Pq = e~^°^^° and the desired result then directly follows from the equation above. 



A2 Proof of Theorem D (Sketch of) 

This result may be derived from careful examination of Theorems 5.3's proof in fi3\ or of The- 
orem 3.2.2's proof in [3l]. Let A^ir(M+) be equipped with the vague convergence topology. Ap- 
plying Aldous, Rebolledo and Roelly criteria (see HEOIES]), the sequence {(^('"\ /("\ }„6n* 
is proved tight. By Prohorov's Theorem, there hence exists a subsequence that converges in 
law to a limiting value {S, I, R) G D(M+, x 7V1^(R+)). This subsequence can be chosen such 

that 1) converges in law to {R, 1) in D(M+,R+). Since the process {(^f\ i?J"^)}t>o 

has jumps of amplitude 1/n, the limiting value is necessarily a continuous process. Using a 
criterion proposed in [2S] , one may prove that a subsequence may again be extracted from the 
previous one, that converges in law to {S, I, R) in ©(R+, x 7V1^(]R_,_)), where A^ir(]R+) is en- 
dowed with the weak convergence topology this time. The evolution equation is then identified 
thanks to the martingale representation provided in Proposition 13.11 The quadratic variation 
vanishes as — > +oo and the moment assumption enables us to take the limit. 

A3 Proof of Theorem D (Sketch of) 

The proof of Theorem 13.21 is an adaptation of the argument developed in [26], [21] and Chapter 
4 of [31]. We refer the reader to these works for hints to a complete proof and give here the 
main steps only. Here and throughout, C{T,N) shall denote a constant, depending on T and 
N only, that will not be necessarily the same at each appearance. 

As previously emphasized, here and throughout the fluctuation process shall be viewed 
as a distribution- valued process. We will deal with the spaces introduced in ([9]). The con- 
tinuous injections which link these spaces are proved in Theorem 5.4 [I]. We consider ([6]) as 
taking its values in the space C~^'^. As the image of C^'^ through the differential operator da is 
included in C^'^, we will be lead to look for estimates in these both spaces. In order to work in 
a Hilbert setting, we consider the continuous injections linking this space with VFq~^'^ (^^ which 
the tightness criterion is proved) and with PVo'^ '^ (subspace of C~^'^ and C~^'^ in which the 
norm of the martingale part of 7]^ is controlled). The continuous embedding C~°'^ VFq"^'^ 
shall also be required in order to control the norm of certain operators of Wq ' . The Hilbert- 
Schmidt embedding Wq^''^ ^ ^o"^'^ is required by the tightness criteria that we use. Finally, 
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the embedding Wq ^'^ C is used to obtain the uniqueness of the hmiting value. 



Fluctuation process. We have the following decomposition: V/ : (a,t) /t(a) in C (M^), 



^1 ^^-^0 



+ Mt\f) + Vr{f), where: 



(")/ 



(26) 























= \fn 

















i,{n) 



where M'^''^"\ M*'*^") and Ml'^"'\ft) have been defined in ([3]) and where: 

{v^(Ai(4"),e^) - Ai(s„,^„)) + (/xi + A2)r7;'(") 
+ v^(A3(^i"), (rW, ^|J)) - A3(z„, (r„, ^)))} 
V'/'^^H/t) = r {/„(0)[A,<'(-) + v^(A3(e\(ri"\^))-A3(^n,(r.,^)))] 
+ «'^"\5a/«(a)+9,/„(a))}dn. 



j,(n) 



u=0 
t 



u=0 



Localization. A difficulty arises from the fact that the size of the population is not a priori 
bounded. In this respect, for any N > 0, consider the stopping time: 



= inf {t > M+, max (.J"), 4"), (rl"\ 1), (rj"), |a|)) > N 
One may easily see that for all > such that 



> max( sup St, sup it, sup (r^, 1), sup (r^, |a|)). 

t€[o,T] te[o,T] te[o,T] te[o,T] 



we have: 



lim P f < r) = 



(27) 



(28) 



(29) 



Moment estimates. It is straightforward that E[sup^g[Q || 77^' || ^.1,3] < C{T,n), for all 

n G N* (see Lemma 4.3.1). Since C{T, n) depends on n, this estimate is not very interesting. 
Its importance lies in setting Wq~^'^ as a reference space. Using ([9]), we can then also consider it 
as a process with values in C~^'^, C~^'^, W^o~^'^' ^o~^'^ ^r C~'''°. Estimates that do not depend 
on n can be obtained by following the proofs of Lemmas 4.4.3, 4.4.4 and 4.4.5 in 
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Lemma 5.1. Let N he fixed as in &^) . Then, 



. \ sup {|r/i 'I + \ri^' '\ + \\r]^ l^-4,i}] 



«,("■) |2 I I i,(")|2 I II »",("•) ||2 



< sup Jll[ sup ||?7i I + |?7i I + ||?7j 



^}] 



<supE[ sup {|r7,^'^"^r + |r7;'("^r + hI'("^||^-.J]<C(Ar,T)<+o^^ (30) 



Proof. The first two inequalities are consequences of tlie continuous injection ©. Tlien, 
follows from the assumed properties of Ai, A3 and ip combined with the definition of ||.||c'-i,2 
and the use of Gronwall's Lemma. □ 

Tightness of the sequence {C{ri^'^^)}nm'' • By using a tightness criterion due to Metivier 
(see Section 2.L5 and Theorem 2.3.2 in |20j and Lemma C in we will prove thanks to the 
preceding moment estimates that: 

Lemma 5.2. The sequence {£(?7'^"^)}„gN* of the laws of the fluctuation processes {?7''"^}„gN*; 
when considered as a sequence o/D([0, T], x W^'^"^), is tight. 

Proof. At first, we have to show that: Vt G [0,T], 



S>(")|2 I |^*,(")|2 I ||^'':(")||2 



supE ir^r'r + i^rr + ii^; 





4,1 



< +00, 



(31) 



where W^^'"^ is a Hilbert space that is embedded in W^'^'^ by a Hilbert-Schmidt embedding. 
(I3TI) is a consequence of Lemma 15. 1[ 

Then, we establish an Aldous type condition for the finite- variation processes {V^''"^}neN*, 
the quadratic variation processes {{M^'^'^'>)}n^^* and {(M*'*^"))}„gN* and the trace processes 
{^M'^'(")^}„eN* of {M^'(")}„eN-, defined for a Hilbert basis {^k)k&i' of W^'^ by: Vn G N*, Vt G 
[0,T], 



(32) 



k>l 



Since Efc>i¥'fc(0) < C (see [21]), ^M'''(")^t is P-almost surely defined. 

Let 5 > and (S'„,T„)„gN. be a family of stopping times such that Sn < T„ < S'„ + 5. We 
have: Vn G N*, V? > 0, 



P 



Wo''' 



w- 



l+P(d"^<T). (33) 



To bound the first term, we bound 



E 



/ 



by 



E 



9,/(a)r/;'(")(rfa)rfM 
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c- 



by noting that V/ G C^'^ daf G C^'^ with \\dj\\c^.2 < 



C2.- 



and that Vs, t e [0, T], Vn G N* 



daf{a)r]:;^^\da)du < 
Using Lemma [5. H we thus obtain: 



\c- 



C2 



p(d"^<T). 



(34) 



Similar computations can be carried out for V^'^"), V''^''\ (M"'('")), (M^'(")) and ^M^'(")^. 

With (1311) and the criterion in Lemma C of |24j is satisfied and Lemma 15^ is proved. □ 

By virtue of Prohorov's theorem, the sequence {C{T]^^^)}n^fq* is relatively compact inP(D([0,T], 
Wq~^'^)) embedded with the weak convergence topology. The proof of Theorem 13.21 is finished 
by showing that there is a unique adherence value. 

Identification of the adherence values. Let r] G D([0,T],M^ x PVq"^'^) such that C{ri) is 
an adherence value of this sequence. In order to simplify notation, denote again by (^7*^"'')nGN* 
a subsequence that converges in law to rj. Since the magnitude of the jumps of r/*^") is of order 
1/n, the limiting process r] is continuous. 

A first difficulty arises from the fact that Lemma 15.11 only deals with the fluctuations local- 
ized by the stopping times which depends on N. 

We start off with studying the tightness and the convergence in law of the martingales 
(M("))„gi^. for which estimates that do not depend on n nor on N can be established (see [M], 
Lemma 4.4.5). We can prove, using the same tightness criterion as above, that (M'^"^)„gN. is 
tight in D([0, T], x Wq"^'^). Let W = {W, W\ W"^) be continuous martingales as in Theorem 
m Ve > 0, V0 G Wq'\ 



P(sup (M'-'(")(0))i-(l^^(0))i 



sup 

te[o,T] 



<-E[ sup 



> e 



n 



n 



< 



+ L;{N)N 
C{N,T,\mc. 



i,(n) I 
Vs I 



I II '~l(")ll 



Cl.2 



n 



}ds] 



e\ n 



-E[ sup 

te[o,TAd"'] 



r,{n) 



|c-M]+P(d"^<T 



By 0291) and by Lemma [5. 11 this gives that ((M^'*^"'^(0)))„£f^* converges in probability and uni- 
formly in t G [0,T] to (W^^(0)), defined in Theorem 13.21 Since sup^gjQ j.] |AMj'^"''(0)| is bounded 
by C I ^fn and hence uniformly integrable, we obtain by applying Theorem 3.12 page 432 of Ja- 
cod and Shiryaev [19] that (M'''(")(0))„gN. converges in law to the continuous square-integrable 
gaussian martingale (0) starting from and with quadratic variation given by ffTTj) . Similar 
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computations can be done for the other terms of the bracket. Then, (M")„gN* converges in law 
in D([0,T],M2 X Wo~^'^) to a process W G C([0,T],R2 x W^^'^) such as in Theorem [O 

To characterize now the hmit value i], we introduce the following functional, for u = 
(^/^z/^^/'■) G ©([0,T],R2 x H^o"^'^), G H^q'^ and t G [0,r]: 

V ^li^) J V / 

/ -i^J'0 + ^sXl{su,^u))K-^IXlisu,^u)K \ 

- / 95Ai(s ) + + A2 + 97-A3(i V'»)'^;-9HA3(i.,(r.,V))«,V'> \du 

V </.(t-«)(A2 + 9/A3(i„,(r,,V»)i^^ + '^(i-«)5i?A3(i„,(r„,V^))«,V) / 

Notice that, in the definition of \E', the density dependence has been "frozen". We can 
show that the process of C([0,T],R2 X W^^'^) defined for every G Wo and t G [0,T] by 

^M/, Ml, M[{(f))^ := ^!{r],(j),t) has the same law as the process W defined in Theorem 13. 2[ 
Indeed, since (?7''"^)ngN* converges in law to the continuous process r/, we have 

V0Gl<'\ lim ^(r/("\0,.) = ^(77,0,.). 

n— >+oo 

We shall now prove that ^!{ri'^'^\ (f), .) has the same limit in law as M*^"^(0). 
From ([6D and ([35]): Vn G N*, V0 G Vro'\ Vt G [0,T], 

0(t- s)5(n,0,s)cis , (36) 
where: 

A{n,u) = 95Ai(s„,2„)r7:'(") + 9,Ai(s„,z„)r/j,'(") - v^{Ai(4"), zi")) - Ai(s„,z„)}, 
Bin,s) = {9,A3(^„,(r„,^))<'(«)-9RA3(z„,(r„,^))(77;'(-),^)} 
- v^{A3(2(r), (ri"),^)) - A3(2„, (r., V^))}. 

We obtain from 

s,(n) 

Ai(sj['\i^"^) - Ai(s„,i„) = Ai(s„ + -^,iu + ^W) - Ai(s„,z„) 

/n \ n 



rl s,{n) i,{n) s,(n) s,(n) i,{n) t,{n) 

and from the Lipschitz properties of 95A1 and 9/Ai that [^(ri,^)! < C(su, 'i„)(|r7u '•"^P+l?]^'*-"^^)/-^ 
and: 



P I sup 

t6[0,T] 







\ C7iVE(sup„,[o.TAC]^]|r^S-(")p + |r^i(")|2 



>H < ^ ^ + P(G<T)(37) 
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which tends to zero as n — oo, by virtue of Lemma [5.11 and of fl29p . We deal with the term 
B{n,s) with similar computations and obtain that B{n, s)ds converges in probability to 
uniformly in t G [0,T]. As a consequence, \&(?7*^"\ 0, .) converges uniformly in t G [0,T] and in 
probability to the same limit as (M''("), M^'("), M'''(")(0)), which shows that the limiting values 
f] satisfy (fTOl). 

In order to complete Theorem I3.2f s proof, we establish that the strong uniqueness property 
holds for fllUI) (for given W and ?7o). For this, we work in the space C([0, T], x C~'^'°). These 
results rely on the use of Gronwall's lemma and on the fact that when the density dependence 
is 'frozen' {s^'^\ i^"'^ or r^^\da) have been replaced by their deterministic limits) the constants 
appearing in the estimates do not depend on the localization in any more. 

Consequently, the adherence value of {C{r]^^^))nQ^* is unique and the sequence t]^"^ converges 
in law in D([0,T],M2 x ly-^-i) to the solution of SDE 



A4 Proof of Theorem 14.11 

Using representation (fT3|) . the convergence (fT5|) directly results from Theorem 13.11 combined 
with the assumed smoothness properties of Ai, A3. Now, by virtue of the identifiability assump- 
tion, the limiting contrast K{6,6*) equals to in the sole case where 6 = 6* and (fTUj) then 
follows from the regularity assumption Rl in a standard fashion (see [T6]). 



A5 Proof of Theorem 14.21 

Observe first that the map 6 G Q t—>- lip\6) is twice different iable, and denoting by G O the 
true value of the parameter, for all 6' G and T > 0, the score V 0l^^\6) equals to 

r VeH6) ^ J 
=oJu=o H^) {o<«<"^2(«*)4-'}^ [at,au) 

^ V,A3(.g,(r£W),g) ^ . , , 

- n[ {V,A2(r)^(r) + V,A3(^(r),(ri«),V^),^)}d« 

Ju=0 

Let us define = -nelP{6). We have 

lu=0 



+ 



4?) = -n {neX2{e)it^ 



T roo 



t=0 Ju=0 



VeX2{0) VeA2(g) .(n). 



{0<u<nA24"'} 



+ 



V,A3(^^\ {rt\i^),0) •* V0Xsit\ ^-^V'),^) 



^ ■^{A2(e*)m(:'<«<A2(e-)m(:'+nA3(i^:',(r(:',V>,e*)} 



Q\dt,du). 



We have the following result. 
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Lemma 5.3. Under the assumptions of Theorem ("JTS), we have: 
(i) Jg?^ — > le* in Fq* -probability, as n ^ oo, 

(a) for all T > 0, the sequence of processes {{n~^^'^'V0l["'\6)}te[o,T], n ^ N*), converges in 
law in ©(R+,M) to the continuous gaussian martingale process with as initial value and 
quadratic variation given by: 



Proof. The first assertion follows from Tlieorem 13. 1[ Tlie second one is a consequence of 
Tlieorem [321 □ 

The argument of the asymptotic normahty results may be classically derived from the lemma 
above (see Chapter 4 in [23] for instance). Technical details are omitted. 
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